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AID FOR COMPOSING WORDS OF SONG 



This application is based on Japanese Patent Application No. 
2000-123965. filed on April 25. 2000. the entire contents of which are 
5 incorporated herein by reference. 



BACKGROUND OF THE INVENTION 

a) FIELD OF THE INVENTION 

The present invention relates to techniques of aiding a work of 
1 0 composing words of a song. 

b) DESCRIPTION OF THE RELATED ART 

As a conventional word composing apparatus, a word composition 
aiding apparatus is known which is disclosed in JP-A-1 0-97529. This 
apparatus divides music into a plurality of paragraphs, sets a word composing 

15 condition to each paragraph, derives words matching the word composing 

condition from a "poem piece data bank" to present them to a user, and stores 
the word selected by the user to compose words. The apparatus also detects 
the number of syllables of a given melody so that words matching the number of 
syllables are derived from the "poem piece data bank". 

20 With this conventional apparatus, a correspondence between 

each note of a given melody and a character of song words is not so easy to be 
understood. In deriving words matching the number of syllables of a melody, it 
is desired to consider a "phonetic sound" of each character to derive more 
general song words. For example, a word "^^/u" is pronounced either as two . 

25 notes of "7i^" and "A/" or as a single note of ";0^A/". If words having the number 
of characters equal to the number of syllables are derived, only the words 
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corresponding to the two tones in this example are derived and words 
corresponding to one tone cannot be derived. 

It is very difficult for a songwriter novice to understand what part of 
speech is placed at which position to compose good song words. It is effective 
5 for aiding a songwriter in such a point. 



SUMMARY OF THE INVENTION 

It is an object of the present invention to provide techniques of 
providing an effective aid in composing song words matching a melody. 

10 According to one aspect of the present invention, there is - 

provided a word composing apparatus comprising: a melody display controller 
for visually displaying a given melody on a display connected to the word 
composing apparatus; an input cell display controller for displaying an input cell 
corresponding to each note of the melody on the display; and an input 

1 5 processor for executing a process of inputting letters or characters of a song 
word corresponding to a note to the input cell. 

A continuous character display controller may be provided which 
displays a continuous character train in a continuous display area of the display, 
wherein characters input to the input cells are displayed in the continuous 

20 display area as a continuous song word character train. 

A continuous character train includes a train of long song words 
displayed in rows and a train containing spaces intentionally inserted. The 
continuous character train has continuous characters in at least two adjacent 
input cells. 

25 According to another aspect of the present invention, there is 

provided a word composing apparatus comprising: a dictionary storing at least 
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words and parts of speech corresponding to the words; a search condition 
designator for designating at least a part of speech as a search condition; a 
word search unit for searching a word matching the search condition designated 
by the search condition designator from the dictionary; a search result display 
5 unit for displaying a searched word; and an input processor unit for executing a 
process of inputting a word displayed on the search result display unit as letters 
or characters of a song word. 

According to various features of the invention, the following 
advantages can be obtained: 
10 a relation between each note of a melody, a character of song 

words, and a syllable can be easily understood; 

continuous words separated in respective input cells can be 
displayed as a continuous character train or syllable train; 

words of a continuous character train or syllable train can be 
1 5 made easy to look; 

input cells still not input with song word characters or syllables can 
be found easily; 

a word can be selected by designating a part of speech; 

a word can be selected by designating the number of syllables 
20 and a part of speech; 

a plurality of characters or syllables can be made to have a 
correspondence to one note; 

an order of parts of speech can be determined simply by selecting 
a sentence syntax template; 
25 each section of melody can be made easy to have a 

correspondence to a part of speech; and 
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an effective aid in composing words nnatching a melody can be 

provided. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figs. 1 A and 1 B are plan views showing examples of a word 

composing screen according to a first embodiment of the invention. 

Fig. 2 is a block diagram showing the main part of the word 
composing apparatus of the first embodiment. 

Fig. 3 is a diagram illustrating an input operation of inputting a 
10 character or the like in an input cell according to the first embodiment. 

Fig. 4 is a diagram illustrating an input operation of inputting a 
display control command according to the first embodiment. 

Fig. 5 is a block diagram of the word composing apparatus 
utilizing a personal computer and software according to an embodiment of the 
15 invention. 

Fig. 6 is a flow chart illustrating a main routine of a word 
composing program according to the first embodiment. 

Fig. 7 is a flow chart illustrating a word input/edit process 
according the first embodiment. 
20 Fig. 8 is a block diagram showing a main part of a second 

embodiment. 

Fig. 9 is a table showing an example of the contents of a 
dictionary according to the second embodiment. 

Figs. 10A and 10B are diagrams showing input display screens 
25 corresponding to a search condition designating unit according to the second 
embodiment. 
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Fig. 1 1 is a diagram showing an example of search results in a 
search result list box according to the second embodiment. 

Fig. 12 is a table showing another example of a dictionary 
according to the second embodiment. 
5 Fig. 13 is a block diagram showing a main part of a third 

embodiment. 

Fig. 14 is a diagram showing an example of a sentence syntax 
template database according to the third embodiment. 

Fig. 15 is a table showing an example of a vocabulary database 
10 according to the third embodiment. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Embodiments of the invention will be described with reference to 
the accompanying drawings, 

15 Fig. 5 is a block diagram showing the structure of a word 

composing apparatus constituted of a personal computer and software 
according to an embodiment of the invention. The personal computer has a 
CPU 1 . a ROM 2, a RAM 3, a timer 4, a display 5. a mouse 6, a keyboard 7, a 
digital signal processor (DSP) 8, a D/A converter (DAC) 9, a sound system 10. 

20 an external storage device 1 1 . a MIDI interface 12, a communication interface 
13 and a bus 14, DSP 8. DAC 9 and MIDI interface 12 are mounted on a 
sound card or the like. The display 5, mouse 6, keyboard 7 and external 
storage device 1 1 have interfaces. 

CPU 1 runs on an operating system (OS) installed, for example, in 

25 a hard disk drive (HDD) of the external storage device 1 1 by using a working 
area of RAM 3 to execute a normal control operation. Specifically. CPU 1 
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executes a display control of the display 5, inputs data in response to an 
operation of the mouse 6 and keyboard 7, controls the display position of the 
mouse pointer on the display 5, and detects a click operation of the mouse 6. 
In this manner, an input setting operation by a user can be performed by a 
5 so-called graphical user interface (GUI) process with the display screen of the 
display 5 and the operation of the mouse 6. 

When melody data is reproduced, CPU 1 executes an interrupt 
process in response to an interrupt signal from the timer 4 to reproduce tone 
data in the melody data or supply data of each performance to a tone generator 
10 in DSP 8. DSP 8 generates digital musical tone signals corresponding to the - 
data supplied from CPU 1 and supplies them to DAC 9. DAC 9 converts the 
digital musical tone signal into an analog audio signal which is supplied to the 
sound system 10 including an amplifier, a speaker and the like to generate 
musical sounds. 

15 The external storage device 1 1 may be one or a plurality of 

devices including a hard disk drive (HDD), a floppy disk drive (FDD), a CD-ROM 
drive, a magneto optical (MO) disc drive, a digital versatile disk (DVD) drive. 
For example, the external storage device 1 1 supplies CPU 1 with melody data 
and a word composing program. The external storage device 1 1 is also used 

20 for storing composed word data and databases for dictionaries and sentence 
syntax templates. The MIDI interface 12 transfers various data to and from 
another MIDI instrument 15. For example, the MIDI interface 12 receives 
melody data supplied from the MIDI instrument 15 and outputs composed word 
data to the MIDI instrument 15. 

25 The word composing apparatus can be connected via the 

communication interface 13 to a communication network 16 such as a local 
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area network (LAN), the internet and a telephone line so that it can receive from 
a server computer 17 various data such as a word composing program, melody 
data, dictionary data, and sentence syntax template data. For example, the 
data including the word composing program, melody data, dictionary data, and 
5 sentence syntax template data is stored in the hard disk drive (HDD) of the 
external storage device 1 1 . CPU 1 develops the word composing program 
stored in the hard disk drive (HDD) upon RAM 3 and executes the program in 
RAM 3 to control a word composing process. 

10 (1st Embodiment) 

Fig, 2 is a block diagram showing the main part of the word 
composing apparatus according to the first embodiment. An input designating 
unit A includes the mouse 6 and keyboard 7. A word input/edit unit B includes 
an input area set on the display 5, a graphic RAM and the like. A word 

15 continuous display unit C includes a display area set on the display 5, a graphic 
. RAM and the like. A melody data storage unit D includes RAM 3 or the 
external storage device 1 1 for storing melody data for which words are 
composed. A melody display unit E includes a display area set on the display 
5, a graphic RAM and the like. The display 5 may be an external display. 

20 A process at each unit is executed by running CPU 1 on the word 

composing program. In accordance with the melody data stored in the melody 
data storage unit D, a melody is visually displayed on the melody display unit E. 
Specifically, note bars are displayed on a piano roll score as note display 
elements. In accordance with the melody data stored in the melody data 

25 storage unit D, the word input/edit unit B is displayed by separating it into input 
cells corresponding to respective notes. When the input designating unit A 



7 



inputs each character to the word input/edit unit B, characters input from the 
word input/edit unit B are displayed as a continuous character train on the word 
continuous display unit C. 

Figs. 1A and 1B are diagrams showing examples of a word 
5 composing display screen of the display 5. Fig, 1A shows English song words, 
and Fig. 1B shows Japanese song words. On this word composing display 
screen, a piano roll score 20 corresponding to the melody display unit E, a word 
input/edit area 30 corresponding to the word input/edit unit B, and a word 
continuous display area 40 corresponding to the word continuous display unit C 

10 are displayed. The piano roll score 20 has a laterally elongated area divided in 
an up/down direction to simulate a piano keyboard. Each area divided in the 
up/down direction corresponds to the pitch of each key of the piano, the lower 
area being a lower pitch key and the higher area being a higher pitch key. The 
horizontal direction from the left to right corresponds to a time lapse direction. 

15 A measure line LI is displayed to partition each measure in the time lapse 
direction. A tempo line L2 is displayed corresponding to a tempo position. 
Note bars 21 1, 2I2..... 21$ corresponding to notes are displayed as note display 
elements of the melody. The suffix of the reference numeral is used for 
distinguishing between a plurality of same elements. In the following 

20 description, if the elements are not necessary to be distinguished, the suffix is 
omitted. The note bar 21 indicates a location (time position) along the right/left 
direction on the screen, and the length of the note bar 21 indicates a gate time 
(reproduction continuation time) of the note. In this example, a four-four time is 
used, other times may also be used. 

25 The word input/edit area 30 is displayed just under the piano roll 

score 20 and is divided into input cells 31 1, 3I2...., 3l8 each having the length 
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corresponding to a tempo number of each note bar 21 . The length of the note 
bar 21 corresponds to the gate time. Therefore, for example, a quarter note 
does not have one tempo length (a distance between tempo lines L2). Each 
input cell 31 has a length corresponding to the length of a corresponding note in 
5 accordance with the location of each note bar 21 . For example, in Figs. 1 A 
and IB, the note bars 21 1, 2I2. 2I3. 2l6. 2I7 and 2l8 are quarter notes and the 
corresponding input cells 31i, 3I2, 3I3, 3l6. 3I7 and Sis have a one-tempo 
length, whereas the note bars 2I4 and 21 5 are half notes and the corresponding 
input cells 3I4 and 31 5 have a two-tempo length. The length of each input cell 

1 0 may be set to the gate time (length of the note bar). 

Each input cell 31 is input with a syllable(s) or character(s), or a 
command (hereinafter called a "display control command") for controlling a 
display style of words or a song word character train. 

In the example shown in Fig. 1 A, "There" is input to the input cell 

15 31i, "are" is input to the input cell 3I2. "beau" is input to the input cell 3I3, "tiful" 
is input to the input cell 31 4, "skies and an enter (return) code" are input to the 
input cell 31 5. "To" is input to the input cell 31 7, and "kyo" is input to the input 
cell 31 8. The input cell 31 e is still not input. 

The word continuous display area 40 is displayed under the word 

20 input/edit area 30 and displays a continuous syllable train in a plurality of rows 
and columns. In the example shown in Fig. 1 A, a character or letter train 
"There are beautiful skies" is displayed in the first row, the train having 
continuous syllables in the input cells 31 1 to 31 5, and the enter command inp^ut 
to the input cell 31 5 returns the first row. In the second row. no syllable is input. 

25 to the input cell 31$ so that this input cell is empty, and the syllables "To kyo" in 
the input cells 31 7 and 31 a are displayed. 
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For example, since no syllable is input to the input cell Sis, a 
non-input symbol 41 (in the example shown in Fig. 1A, □) is displayed to 
indicate that no syllable is input. By looking at the word continuous display 
area 40, it is easy to confirm a presence of an input cell 31 without no input 
5 syllable and a corresponding note bar 21. The non-input symbol is not 
displayed after the position of the enter command as in the first row. 

In the example shown in Fig. IB, "fo" is input to the input cell 31 1. 
" b" is input to the input cell 31 2, "/c and an enter (return) command" are input 
to the input cell 3I3, "TA^" is input to the input cell 3I4, is input to the input 
10 cell 3I5. "/ct" is input to the input cell 31 7. The input cells Sis and 3l8 are still - 
not input. 

The word continuous display area 40 is displayed under the word 
input/edit area 30 and displays a continuous syllable train in a plurality of rows 
and columns. In the example shown in Fig. 1B, a character or letter train "fc b 

15 fc'' is displayed in the first row, the train having continuous syllables "fc", " 
and "fz" in the input cells 31 1 to 31 3, and the enter command input to the input 
cell 3I3 returns the first row. In the second row, a character or letter train "T/u 

is displayed, the train having continuous syllables "TA^" and in the 
input cells 3I4 and 3I5, and no syllable is input to the input cell 3l6 so that this 

20 input cell is empty, and the syllable "^j:" in the input cell 31 7 is displayed. 

For example, since no syllable is input to the input cells 31 e and 
3l8, a non-input symbol 41 (in the example shown in Fig. IB, □) is displayed to 
indicate that no syllable is input. By looking at the word continuous display' 
area 40, it is easy to confirm a presence of an input cell 31 without no input 

25 syllable and a corresponding note bar 21. The non-input symbol is not 
displayed after the position of the enter command as in the first row. 

10 



Fig. 3 is a diagram illustrating an input operation of inputting a 
character or the like to the input cell 31 . As the pointer P of the mouse is 
moved to the input cell 31 and the mouse is left-clicked, an editor area 31 A for 
input only is displayed. In this state, a character is input, for example, by using 
5 the keyboard 7. This character input process can be realized by well known 
techniques such as a word processor function (e.g., Japanese language input 
function) of a personal computer. By opening the editor area 31 A and 
selecting a character or word displayed in a search result list box as in second 
and third embodiments to be described later, a character or word can be input. 

10 By operating an enter (return) key. the editor area 31 A disappears and letters or 
characters (or words) input to the editor area 31 A are input to the input cell 31 . 

Fig. 4 is a diagram illustrating an input operation of inputting a 
display control command. As the pointer P of the mouse is moved to the input 
cell 31 and the mouse is right-cticked. a command select box 31 B is displayed. 

15 In this command select box 31 B. the names (space, a center dot delimiting 
syllables, hyphen, enter and the like) of a display control command are 
displayed. The display control command is used for controlling the display 
style, not relevant to sound reproduction, of a song word character train in the 
word continuous display area (continuous display unit) 40. By using the 

20 pointer P of the mouse 6 or the cursor keys of the keyboard 7. a command in 
the command select box 31 B is selected so that the selected command is input 
to the input cell 31 . The letter(s) or character(s) or command data input 
(established) to the input cell 31 is written in the storage area of RAM 3. In" 
accordance with the stored data, a letter(s) or character(s) is displayed in the . 

25 word continuous display area 40. Specifically, when the "space" is selected, a 
space (underline or an empty area without the non-input symbol 41) is inserted 
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in the word continuous display area 40. when the "hyphen" is selected, a 
hyphen is inserted, and when the "enter" is selected, an enter is inserted. 

As the display control command is input, the display style of 
the word continuous display area 40 is changed. In this case. for the space, 
5 for the hyphen and "J" for the enter may be displayed in the input space as 
the display control command. The user can confirm such display easily. 

Fig. 6 is a flow chart illustrating a main routine of the word 
composing program according to the embodiment, and Fig. 7 is a flow chart 
illustrating a sub-routine of the word input/edit process. The control operation 

10 of CPU 1 will be described with reference to each flow chart. As the main 
routine shown in Fig. 6 starts, at Step S1 an overall display screen of the piano 
roll score 20, word continuous display area 40 and the like are displayed (the 
word Input/edit area 30 is displayed later). At Step S2 a user designates a 
melody select process including a step of reading melody data from the hard 

1 5 disk drive or the like of the external storage device 1 1 . Next, at Step S3, note 
bars 21 corresponding to the notes of the selected melody are displayed on the 
piano roll score 20. At Step S4, the word input/edit process shown in Fig. 7 is 
executed. At Step S5 other processes are executed to thereafter return to 
Step S1. The other processes include a process of saving the composed 

20 words. 

In the word input/edit process shown in Fig. 7, at Step S1 1 in 
accordance with the selected melody data, the word input/edit area 30 is 
displayed by dividing it into input cells 31 corresponding to the notes of the 
melody, and the non-input symbol ("□") 41 is displayed in each row and columa 
25 of the word input/edit display area 40. Next, at Step SI 2 it is checked whether 
there is a left click operation of the mouse on any input cell 31 . If there is no 
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left click operation, the flow branches to Step SI 6, whereas if there is a left click 
operation, the flow advances to Step S13. At Step SI 3 the editor area 31 A is 
displayed. Next, at Step S14 an input operation of inputting a letter(s) or 
character(s) is performed in the editor area 31 A. and data of the input letter(s) 
5 or character(s) is stored in the memory (e.g., RAM 3) and the input letter(s) or 
character(s) is displayed in the word continuous display area 40. Namely, the 
input letter(s) or character(s) is displayed by replacing it by the non-input 
symbol ("□") 41. At Step S15 the editor area 31A is closed to follow Step S16. 
With the above processes, an input process of inputting a letter(s) or 

10 character(s) can be performed as described with reference to Fig. 3, 

At Step S16 it is checked whether there is a right click operation of 
the mouse on any input cell 31 . If there is no right click operation, the flow 
skips to Step SI 9. whereas if there is a right click operation, the flow advances 
to Step SI 7. At Step SI 7 the command select box 31 B is opened and 

15 displayed. Next, at Step SI 8 a process of selecting the display control 

command by the user is executed, the selected display command is stored in 
the memory (e.g., RAM 3), and in accordance with the selected display control 
command, the display style of the word continuous display area 40 is controlled 
to thereafter close the command select box 31 B and advance to Step SI 9. At 

20 Step SI 9 it is checked whether there is a termination instruction of the word 
input/edit work. If there is a termination instruction, the flow returns to the main 
routine, whereas if there is no termination instruction, the flow returns to Step 
SI 2. With the above processes, an operation of inputting the display control 
command described with reference to Fig. 4 can be performed. 

25 In the above-described embodiment, when the input cell is to be 

selected, the pointer P is moved to the input cell and the left click of the mouse 
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is performed. When the succeeding input cell is to be selected, the arrow key 
or <-) or a TAB key of the keyboard may be used. By operating the enter 
key or the like in the state that the input cell is selected, the editor area 31 A may 
be opened. Further, although after the input cell is selected and the editor 
5 area 31 A is opened, a character(s) or display control command is input, the 
character(s) or display control command may be written directly in the input cell. 
If the character train becomes long, the input cell may be expanded to two 
rows. 

Also in the above embodiment, although note bars 21 are 
10 displayed on the piano roll score 20 to visually display the melody, the melody • 
may be displayed on a staff notation or a TAB score. 

(2nd Embodiment) 

Next, the second embodiment will be described. The second 

1 5 embodiment pertains to a function of aiding a user in determining a character(s) 
or word(s) of song words. A process of inputting a character(s) or word(s) 
determined by the second embodiment and a process of displaying an input 
display screen can use the processes of the first embodiment including the 
process of inputting a character(s) to the input cell and the process of displaying 

20 the piano roll score 20. word input/edit area 30 and word continuous display 
area 40. 

Fig. 8 is a block diagram showing the main part of the second 
embodiment. A search condition designation unit F and a search start 
instruction unit G are made of GUI utilizing the mouse 6 or keyboard 7 and a 
25 screen of the display 5. A search result list display unit H is a list box or the 
like displayed on the display 5. A word search unit I is a function obtained 



14 



while CPU 1 executes the word composing program of the second embodiment 
of the invention. A dictionary J is a database stored in a hard disk or the like of 
the external storage device 11. A select/input unit K is, for example, the input 
cell 31 , editor area 31 A or the like of the first embodiment. 
5 As shown in Fig. 9, the dictionary J to be searched has a number 

of sets of a word and a part of speech registered. The part of speech includes 
a noun, a pronoun, a verb, an auxiliary, verb, an adjective, an adverb, a 
preposition, a conjunction, an interjection, an adjective verb, a participal 
adjective, a postpositional auxiliary word, and the like. 

10 Figs. 10A and 10B are diagrams showing input display screens - 

corresponding to the search condition designation unit F. The input display 
screens display an input menu 51 for inputting the number of syllables or 
sounds and an input menu 52 for inputting a part of speech. The input menus 
51 and 52 display the number of syllables and the part of speech presently 

15 selected. In the usual case, only the Input menus 51 and 52 are displayed. 
When the pointer P of the mouse 6 is moved to either a pull-down switch 51a or 
52a and the mouse is clicked, a select box 52A such as shown in Fig. 10B is 
displayed. When the pointer P of the mouse 6 is moved to any one of the 
parts of speech displayed in the select box 52A and the mouse is clicked, or 

20 any one of the parts of speech is selected by using an arrow key (t or 4-) and 
the enter key. the selected part of speech is displayed in the input menu 52. 
Similarly, the pull-down menu for the input menu 51 for inputting the number of 
syllables displays "one-syllable word, two-syllable word, three-syllable word..^..". 

As the search condition including the number of syllables and the - 

25 part of speech is entered by using the search condition designation unit F in the 
manner described above, the entered search condition is sent to the word 
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search unit I. The search condition may be selected from any one of 
candidates prepared beforehand or a user may designate and input any word. 

Next, when the search start instruction unit G instructs the word 
search unit I to start a search, the word search unit I sequentially read words 
5 registered in the dictionary J, judges whether the read word matches the search 
condition, and picks up the word matching the search condition. In this pickup 
process, the number of syllables or sounds of a read word is calculated based 
upon syllable number count rules to be described later, and it is judged whether 
the counted number of syllables satisfies the search condition and whether the 

1 0 part of speech of the read word satisfies the search condition; For example, if 
the search condition is "two-syllable word and noun" as shown in Figs. 10A and 
10B. it is judged whether the read word is a two-syllable word and whether the 
part of speech field of the read word indicates a noun. In this manner, words 
of a two-syllable word and a noun are picked up. Other search conditions may 

1 5 also be applied. For example, the characteristics of a word may be used as 
the search condition, such as "human related words", "animal related words", 
"nature related words", and "place name related words" for the noun words. If 
one word can be pronounced in many ways, words capable of being 
pronounced in many ways are also checked. For words capable of liaison, 

20 words with liaison and words without liaison are also checked. 

The following five syllable number count rules are know for 
Japanese words. 

1) For a word having a small letter such as " L-^" and the small letter 

is not counted. 

25 2) For a word having "Ay" at a position except the start position such as ";6^A/" 
and " b A/", the "A^^" is counted in some case and not counted in other cases. 



3) For a word having " 9 " at a position except the start position such as " i: 9 *' 
and 9 the " 9 " is counted in some case and not counted in other cases. 

4) For a word having "fc" at a position except the start position such as ":/)^^" 
and fo". the "fc" is counted in some case and not counted in other cases. 

5 5) A prolonged sound in a katakana word is counted in some case and not 
counted in other cases. 

These syllable number count rules are applied in the manner as in 
the following specific examples. 

When a word "V^ b-^" is read, the rule 1) is applied. Since "i/^" 
10 and " U-^" are both one syllable, it is judged that the word is a two-syllable 
word. 

When a word ":d^A/7t Ay" is read, the rule 2) is applied. Since 
Ay" and "fc A/" are both one syllable or two syllables, it is judged that the word " 
;0^A/7t A/" corresponds to each of a two-syllable word, a three-syllable word and 
15 a four-syllable word. 

When a word "fcV^# if 9 " is read, the rule 3) is applied. 
A word "fcV^# <if " has each one syllable and four syllables in total. Since " 9 " 
is counted in some case and not counted in other cases, it is a zero-syllable 
word or a one-syllable word. Therefore, it is judged that the word "fc V ^ # if 9 " 

20 corresponds to each of a four-syllable word and a five-syllable word. 

When a word $ A/" is read, the rules 2) and 4) are applied. 
A syllable "fo" of is one syllable and a syllable "fo" is either zero syllable 

or one syllable according to the rule 4) so that the whole of is one 

syllable or two syllables. Similar to the explanation of ";6^Ay7t Ay", a syllable " 

25 Ay" is either one syllable or two syllables. Therefore, it is judged that the word 
";6^fc ^ Ay" corresponds to each of a two-syllable word, a three-syllable word 
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and a four-syllable word. 

When a word "T — is read, the rule 5) is applied. Since " — " 
is counted in some case and not counted in other cases, it is a zero-syllable 
word or a one-syllable word. Both "T" and "5^" are one syllable. Therefore, it 
5 is judged that the word "T — corresponds to each of a two-syllable word and 
a three-syllable word. 

The search result is sent to the search result list display unit H. 
The search result list display unit H displays a search result list box such as 
shown in Fig. 1 1 and displays searched words in this search result list box. An 

10 example shown in Fig. 1 1 corresponds to the examples described above. The 
words as the search result candidates for the two-syllable word are "T — " 
fcvv\ "V^U^", "V^;i^'\ and ";i^A/fcAy". Since the word "^i^Ayfc/u" is 

an adjective verb, the words matching the search condition of "two-syllable 
word" and "noun" are "T-5^", "fcv^", "V^ b^". "V^^i^-^ and ";^)^fc^^". In the 

15 example shown in Fig. 1 1 , although the meaning of each word is displayed, 
other terms such as synonyms, metonyms, and antonyms may be displayed. 
The dictionary J stores these meanings, synonyms, metonyms, antonyms and 
the like of each word to display them. 

In order to select a word from the search result list box shown in 

20 Fig. 1 1 . the pointer P is moved to the word and the mouse is clicked, or the 
word is selected by using the arrow key (t or i) of the keyboard and the enter 
key. In this manner, the selected word may be input to the input cell 31 or 
editor area 31 A of the first embodiment. 

In this embodiment, the number of syllables of a read word is 

25 counted based upon the syllable number count rules. Instead, as shown in a 
dictionary of Fig. 12, the number of syllables of each word may be calculated in 
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advance based upon the syllable number count rules and stored in the 
dictionary. 

(3rd Embodinnent) 

5 Next, the third ennbodiment will be described. In the third 

embodiment, melody is divided into small melody sections. By using a 
sentence syntax template, each melody section is made to have a 
correspondence with a segment, and a word to be assigned to each segment is 
searched. As a process of inputting a selected word and a process of 

10 displaying input display screens of the third embodiment, the process of 

inputting a character to the input cell 31 and a process of displaying the piano 
roll score 20, word input/edit area 30 and word continuous display area 40 of 
the first embodiment can be applied. 

Fig. 13 is a block diagram showing the main part of the third 

15 embodiment. A database L for sentence syntax templates is stored in a hard 
disk drive or the like of the external storage device 1 1 . 

Fig. 14 shows an example of sentence syntax templates. For 
English sentence syntax templates, the templates are first classified into 
fundamental sentence syntaxes. Elements of each fundamental syntax 

20 include a subject S, a verb V, a complement C, an object O and the like. An 
adverb, an interjection and the like are used for modifications. For example, 
the subject S and object O are further classified into a pronoun, an article + a 
noun, an article + an adjective + a noun" and the like. S, V, O, C and the like 
are herein called segments. 

25 For example, the second template shown in Fig. 14 is S + V + C. 

This template is further classified into a plurality of lower level templates such 
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as (pronoun) + (verb) + (noun), (pronoun) + (verb) + (adjective), and (article) + 
(noun) + (auxiliary verb) + (verb) + (adjective) + (noun). 

For Japanese sentence syntax templates, for example as shown 
in Fig. 14. a plurality of sentence syntax templates are stored, each template 
5 being constituted of a combination of a plurality of segments each 

corresponding to one part of speech. For example, the first sentence syntax 
template at the uppermost row in Fig. 14, is constituted of three segments 
including "[noun] t)^", "[noun] and "[transitive verb]". 

Referring to Fig. 13, a melody division unit N, a segment number 

10 coincidence template search unit O, a syllable number count unit Q for an 
unsettled part of speech, a sentence syntax template decision unit R, an 
unsettled part of speech search unit S and an unsettled vocabulary decision unit 
U are functions obtained when CPU 1 executes the word composing program. 
Each unit operates as in the following manner. 

15 The melody division unit N divides a selected melody M, takes a 

correspondence between each divided section and a segment to determine the 
number of segments of the melody. The melody may be divided automatically 
in accordance with a predetermined algorithm, or manually by a user. For 
example, the melody may be divided by setting a note having a long note length 

20 to the last of a section, by setting a section partition between notes having a 
large pitch difference, or by setting a section partition at a position where there 
is a change in detected chord and tempo. 

After the melody is divided, the segment number coincidence " 
template search unit O searches the database L to detect the sentence syntax . 

25 template whose segment number coincides with the melody. 

It is assumed that a melody whose words are to be written has six 
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syllables and that two templates are selected including: 

S + V + C ...(1) 

S + V + O ...(2) 
A syllable is assigned to each of the segments S, V. O and C. Assuming that 
5 S and V are determined and S and V have four syllables in total, then the 
remaining C or O is required to have two syllables. In this manner, the 
sentence segment is determined one after another, and the remaining segment 
is selected so as to coincide with the remaining number of syllables, two 
syllables. For example, if a lower level template for C or O has (article) + 
10 (adjective) + (noun), C or O cannot be realized by two syllables so that this 
template is discarded from template candidates. Therefore, a template such 
as (noun), (adjective) and (adjective) + (noun) which can be made of two 
syllables is searched for C or O. For example, adjectives and nouns having 
two syllables are searched from a vocabulary database shown in Fig. 15. In 
15 this manner, a portion of the sentence is determined and the remaining 
unsettled sentence is derived from the database to settle it. 

A specific example of a Japanese language sentence will be 
described. It is assumed that a melody whose words are to be written has 
twenty six syllables and that this melody is divided into six sections, i.e., six 
20 segments as indicated by the following formula (1): 

[5 syllables]/[3 syllables]/[4 syllables]/[5 syllables]/[4 syllables]/[5 
syllables] ... (1) 

It is then assumed that the following sentence syntax templates 
(a) to (c) having six segments are selected from the sentence syntax template . 
25 database L: 

[adjectivel/(noun] t /[adjective]/[nounl ^> /[noun] ^ /[transitive 
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verb] ... (a) 

[adjective]/[noun] /[noun] ^ /[adjective]/[noun] "C /[transitive 

verb] ...(b) 

[adjective]/[nounl /[noun] $r /[adjective]/[noun] (D^^iz 
5 /[transitive verb] • • (c) 

In the templates (a) to (c). '7* indicates a delimiter for a segment, 

and "[ ]" indicates a part of speech having the arbitrary number of syllables. 

"t'\ "75^" and the like are a postpositional auxiliary word whose syllable number 

is counted in. 

10 The syllable number count unit Q for an unsettled part of speech - 

calculates the number of syllables of each part of speech in each segment of 
each sentence syntax template candidate (a) to (c) in accordance with the 
melody segment condition of the formula (1), i.e., the number of syllables of 
each part of speech. For example, in the case of the template (a), the 

15 [adjective] of the first segment is five syllables and the [noun] in the second 
segment is two syllables (three syllables - one syllable) because one syllable " 
t " is added to the second segment. The [noun] in the fourth segment is four 
syllables (five syllables - one syllable) because one syllable is added to the 
fourth segment. Similarly, the number of syllables of each part of speech is 

20 calculated for the other templates. 

Next, the sentence syntax template decision unit R discards the 
template not satisfying the syllable number condition from the template 
candidates. For example, the number of template candidates is reduced by 
comparing the number of syllables of each part of speech calculated by the 

25 syllable number count unit Q for an unsettled part of speech with the number of 
syllables in each segment of the melody. Namely, the fifth segment of the 
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template (c) has already four syllables of the postpositional auxiliary word of 
X 0 i^" which is equal to four syllables of the fifth segment of the melody shown 
in the formula (1). Since the syllable number count unit Q for an unsettled part 
of speech counts as a zero syllable the number of syllables of the [noun] in the 
5 fifth segment of the template (c), this template (c) is discarded. Therefore, the 
templates (a) and (b) are left. One of the templates (a) and (b) are selected by 
a user or by using a random number. It is herein assumed that the template 
(a) is selected. 

The vocabulary database T is stored in a hard disk or the like of 

10 the external storage device 1 1 and stores a number of registered sets of a word 
and a part of speech. The unsettled part of speech search unit S selects from 
the vocabulary database T a part of speech designated by the selected 
template and having the syllables equal in number to the calculated number. 
For example, for the first segment, an adjective having five syllables (refer to 

15 the formula (1)) is selected from the vocabulary database T shown in Fig. 15. 
For example, two words "'i'i'S^h bv>" and " 5 o < bv^" are selected as 
candidate words. In counting the number of syllables of each word in the 
vocabulary database T, the syllable number count rules of the second 
embodiment are applied. 

20 The unsettled vocabulary decision unit U displays the candidate 

words selected as described above on the display 5. for example, in a search 
result list box. One word matching the melody is selected from the candidate 
words and decided as a melody word. This selection/decision is performed by 
a user or by using a random number similar to the template selection. If there . 

25 is a database for relevant words, a target word can be selected based upon 
words before and after the target word. 
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A function of selecting again "modifier word" + "noun" or "modifier 
word" + "verb" as one word for an unsettled part of speech of a noun and a verb 
in the template. For example, if a template is [noun] (i [noun] [transitive 
verb] and one modifier word is added to the noun and verb, the template is 
5 changed to [[adjective][noun]] []adjective][noun]] ^ [transitive verb]. 

For example, if some section (segment) is assigned a [noun], a 
relatively long noun coincident with the number of syllables may be selected or 
the [noun] is further divided into [adjective] and [noun] to select the words 
having the same number of syllables. 

1 0 The function of reducing the number of candidate parts of speech 

can be improved further if accents of both English and Japanese words are 
registered in the vocabulary database, and when a part of speech is searched 
from the vocabulary database, the characteristics of a melody are judged to 
select a word having an accent matching the characteristics from the 

1 5 vocabulary database. 

If there is melody sections having the same melody role among a 
plurality of melody sections, the same template is used for such melody 
sections. In this case, if parts of speech in one melody section have already 
been selected, the words of these parts of speech are preferentially selected as 

20 the words in another melody section having the same role. For example, if a 
melody has a measure structure of A1-B1-C1-A2-B2-C2, the same template is 
used for A1 and A2. If the first section (segment) of A1 is determined, 
candidate words for the first section of A2 are narrowed down to those words 
related to those selected for A1 . 

25 The above embodiments are constituted of a personal computer 

and software. The above embodiments may be applied to electronic musical 
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instruments. The word composing program may be stored in a ROM. A tone 
generator, a sequencer, an effector and the like may be discrete devices which 
are interconnected by communication means such as MIDI and various 
networks. 

5 The format of melody data may be any format such as an "event + 

relative time" format which defines an occurrence time of a performance event 
by a time from one previous event, an "event + absolute time" format which 
defines an occurrence time of a performance event by an absolute time in 
music or in a measure, a "pitch (rest) + note length" format which defines 

10 performance data by a pitch and length of a note or by a rest and rest length. - 
and a "direct storage" format which reserves a memory area corresponding to 
each minimum performance resolution and stores each performance event in 
the memory area corresponding to the occurrence time of a performance event. 
In the embodiments, the word composing program is stored 

15 beforehand in the hard disk. The invention is not limited thereto, but the word 
composing program may be recorded in a CD-ROM or CD-R and loaded in a 
hard disk. CPU 1 develops this word composing program upon RAM 3 and in 
accordance with the program read from RAM 3. performs the word composing 
process in a similar manner to the embodiments. In this manner, CPU 1 can 

20 perform similar operations to those when the word composing program is stored 
in ROM 2. With this arrangement, new installation, addition, version-up and 
the like of the word composing program can be made easily. The word 
composing program may be recorded in a floppy disk, a magneto optical (MO) 
or the like and supplied to RAM 3 or a hard disk. 

25 The word composing program may be downloaded by using the 

communication interface 13. In this case, for example, the communication 
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interface 13 is used for the connection to the communication network 16 such 
as a local area network (LAN), the Intemet and a telephone line. The word 
composing program is distributed from the server computer 17 via the 
communication network and recorded in a hard disk to complete the download 
5 A MIDI interface is not limited only to a dedicated MIDI interface, 

but it may be configured by using a general interface such as RS-232C. a 
universal serial interface (SUB), IEEE1394, In this case, data different from 
MIDI messages may be transmitted and received at the same time. 

The present invention has been described in connection with the 
10 preferred embodiments. The invention is not limited only to the above 
embodiments. It is apparent that various modifications, improvements, 
combinations, and the like can be made by those skilled in the art. 
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